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Two types of supernova are thought to produce the overwhelming majority of neutron stars 
in the Universe^ The first type, iron-core collapse supernovae, occurs when a high- mass star 
develops a degenerate iron core that exceeds the Chandrasekhar limit^. The second type, 
electron-capture supernovae, is associated with the collapse of a lower-mass oxygen-neon- 
magnesium core as it loses pressure support owing to the sudden capture of electrons by 
neon and/or magnesium nuclei^ It has hitherto been impossible to identify the two distinct 
families of neutron stars produced in these formation channels. Here we report that a large, 
well-known class of neutron-star-hosting X-ray pulsars is actually composed of two distinct 
sub-populations with different characteristic spin periods, orbital periods and orbital eccen- 
tricities. This class, the Be/X-ray binaries, contains neutron stars that accrete material from 
a more massive companion stai^. The two sub-populations are most probably associated with 
the two distinct types of neutron-star-forming supernovae, with electron-capture supernovae 
preferentially producing system with short spin period, short orbital periods and low eccen- 
tricity. Intriguingly, the split between the two sub-populations is clearest in the distribution 
of the logarithm of spin period, a result that had not been predicted and which still remains 
to be explained. 

Be/X-ray binaries (BeXs) are strong X-ray sources because their neutron stars accrete mate- 
rial at a relatively high rate. Their mass-losing Be-type companions are fast-rotating 8Mq — ISM© 
main-sequence stars that are surrounded by circumstellar "decretion disks". These disks are fueled 
by the injection of mass and angular momentum at the stellar surface^. Neutron star spin periods 
in BeXs are typically 1 - 1,000 s, and BeX orbits are usually elliptical, with orbital periods ranging 
from about 10 - 1,000 d. Most of the accretion takes place during periastron passages, when the 
neutron star passes close to, or even through, the Be-star decretion disk. 



BeXs are exceptionally abundant in the Small Magellanic Cloud (SMC), where a burst of star 
formation ^60 Myr ago^ seems to have produced a large population of these systems'^. In fact, 
the SMC contains a comparable number of BeXs to the Milky Way (MW), even though the mass 
ratio of the two galaxies is about 1:100. By contrast, the number of BeXs in the Large Magellanic 
Cloud (LMC) is broadly in line with its stellar mass content when compared with the Milky Way. 



In the context of studying neutron star formation channels, it is useful to focus on well- 
defined, simple and "clean" populations of neutron-star-hosting systems (that is, systems in which 
the orbital parameters have not yet evolved since the supernova) that nevertheless span a wide 



1 



range of properties. BeXs can provide this. This is not only because the neutron stars in BeXs 
all have the same type of companion, but also because the accretion process itself seeems to be 
universal, with the neutron star spin in or near an equilibrium state in which the magneto spheric 
radius of the neutron star equals the Keplerian co-rotation radiu^^^^. This conclusion is suggested 
empirically by the location of BeXs in the log(Porb)-log(Pspin) plane (the Corbet diagram^^;Porb 
and Pspin are the orbital and period period, respectively), where they tend to lie along a line with 
slope a ~ 2 (Fig. 1). 

The correlation in Fig. 1 between Pspin and P^rh among BeXs is highly significant, but 
the data have large scatter. Despite this scatter, however, the 1-D projections of the data onto 
the log Porb and log Pspin axes both suggest that the BeX population might be bimodal. More 
specifically, the two sub-populations suggested by the data in Fig. 1 have characteristic periods of 
Porb — 40 d and Pspin ^ 10 s (short-period mode), and Porb — 100 d and Pspin — 200 s (long- 
period mode). The bimodality of the BeX population seems to be more prominent in log Pspin than 
in log Porb- This is helpful, because there are additional BeXs, not shown in Fig. 1, for which P^pin 
is known, but Porb is not. For the purpose of analysing the Pspin data on their own, we can therefore 
add these systems to the list of confirmed and probable BeXs. 

Such an analysis is shown in Fig. 2. It confirms that the log Pgpin distribution of BeXs con- 
tains two distinct sub-populations with characteristic spin periods of Pspin — 10 s and Pspin — 200 s 
and similar dispersions of about 0.4 dex. The short- Pspin and long-Pgpin sub-populations contribute 
about 35 and 65% to the total number, respectively. The split into these two sub-populations is 
highly statistically significant for the full sample, and remains significant even if the data set is 
divided by host galaxy. The split also remains significant if we consider only spectroscopically 
confirmed BeXs. Finally, the evidence for two sub-populations even remains significant if we use 
non-parametric statistical tests (which are less powerful, but more robust than the KMM algorithm; 
see the Supplementary Information for details). 

As shown explicitly in Fig 2 (bottom panel), the double-Gaussian decomposition of the in- 
dependent SMC and MW-i-LMC samples are consistent with each other. This makes it highly 
unlikely that selection effects are responsible for the observed bimodality. In any case, it is hard 
to conceive of a selection bias that would select specifically against BeXs with intermediate Pspin 
and/or Porb- We therefore believe that the two modes of the log(Pspin) distribution correspond to 
physically distinct BeX sub-populations. 

In principle, there are at least three ways to account for the existence of these sub-populations. 
First, they could correspond to two distinct neutron star spin equilibria that are accessible at all 
orbital periods. However, even though the bimodality is stronger in P^pin than in Porb^ the exis- 
tence of the Pspin — Porb correlation effectively rules out this possibility. Second, Porb might be 
time-dependent for BeXs, with the two sub-populations representing two distinct long-lived evo- 
lutionary stages. However, the time scale for stellar- wind-driven changes in Porb in the BeX phase. 
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"^Porb ~ ~ 1000 Myr (refs 14, 15), is substantially longer than the maximum duration of this 
phase, TBox.max ~ 20 Myr, the lifetime of an 8Mq star. Thus Porb evolution also cannot account 
for the two observed sub-populations. 

The third explanation that we consider is that the two sub-populations represent two distinct 
BeX formation channels. The most obvious possibility, with the farthest reaching implications, is 
that the two channels are associated with the two distinct types of supernova event noted above. 
More specifically, an iron-core-collapse supernova marks the end point of the evolution of any 
sufficiently massive star, whereas an electron-capture supernova can occur only under highly re- 
strictive conditions. In particular, an electron-capture supernova requires that the core reaches the 
critical density for electron-capture to occur, 4.5 x lO^g cm^ (ref. 16). These conditions might 
be met in the late evolution of intermediate-mass stars^ (those with initial masses satisfying 
8Mq < Minit ^ IOM0), although the relevant mass range is uncertain and may be quite small'^ 
However, it is much easier to meet the conditions for electron-capture supemovae naturally in 
binary systems". 

The outcome of electron-capture supemovae differs from that of iron-core-collapse super- 
novae in two fundamental ways. First, electron-capture supemovae should produce somewhat 
less massive neutron stars (^1.3 Mq) than iron-core-collapse supemova (1.4 Mq) (ref. 3). 
Second, electron-capture supemovae are expected to impart much smaller kicks to the neutron 
stars they produce (average kick velocity of < 50 kms~^) than are iron-core-collapse supemovae 
( > 200 kms"^)(ref. 17). In binary systems, where kicks induce orbital eccentricity, these differ- 
ences could naturally give rise to two distinct sub-populations. The more conventional iron-core- 
collapse channel would produce high-eccentricity binaries containing high-mass neutron stars, and 
the electron-capture channel would produce low-eccentricity binaries containing low-mass neutron 

starpinniini. 

If this is the correct explanation for the two BeX populations we have discovered, they should 
differ not only in Pgpin and Porb. but also in their characteristic orbital eccentricities, e. These 
have so far been measured for only about 20 BeXs. Figure 3 shows the distribution of these 
systems in the log Pspin — e plane. Even though there are only 8 such BeXs with Pspin ~ 40 s, it 
seems that long-Pspin systems have preferentially higher eccentricities than short-Pspin systems. 
The figure also shows the cumulative eccentricity distributions of the slow (Pspin < 40 s) and 
fast (Pspin > 40 s) BeX pulsar sub-populations. A Kolmogorov-Smirnov test shows that, despite 
the small number of systems for which eccentricity has been measured, the maximum difference 
between these distributions is marginally significant Supplementary Information). 

The two BeX populations that we have discovered are more clearly separated in Pspin than 
in Porb- Given that Porb does not evolve significantly within the BeX phase, whereas Pspin does, 
Porb might be expected Porb to be the more faithful tracer of the formation channels. However, 
the Porb distribution after the supemova must depend strongly on the Porb distribution before the 
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supernova, and both low- and high- velocity kicks can in principle produce a wide range of post- 
supernova orbital periods". This may explain why the bimodality is only marginally observed in 
the Porb distribution. By contrast, the equilibrium spin period is expected to depend on several 
system parameters other than Pporb (refs 10-12). If any of these parameters differ systematically 
between BeXs produced by the iron-core-collapse and electron-capture channels, Pspin may be a 
more reliable indicator of formation channel than Porb- 

Our results suggest numerous avenues for further research. First, it is important to expand the 
data base of BeXs with reliable measurements of Pspin, -Porb and e, to confirm and further quan- 
tify our findings. Second, if short-Pgpin BeXs are formed by low-kick-velocity electron-capture 
supernovae, they should have systematically smaller space velocities than long-Pgpin BeXs. This 
prediction might be testableP^. Third, short - Pspin systems should also have systematically lower 
neutron star masses than long-Pgpin systems EEMlEa jj^jg prediction might also be testable'^. 
Fourth, although our discovery of two populations of Be/X-ray pulsars is robust, our suggested 
explanation for their origin is clearly speculative and demands a fuller investigation. Intriguingly, 
recent binary population synthesis calculations have shown that the electron-capture supernova 
channel may be very efficient at forming BeX^. However, whereas the same population syn- 
thesis models also suggest that the electron-capture channel accounts for the overabundance of 
BeXs in the SMC^"^, the two BeX populations we have discovered appear to have similar relative 
abundances in the SMC and the Milky Way. 
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Figure 1 The Corbet diagram for high-mass X-ray binaries. The central panel shows logPorb 
versus logPgpin for neutron-star-hosting high-mass X-ray binaries. Filled circles correspond to 
spectroscopically confirmed BeXs, small dots to confirmed non-BeX systems and open circles to 
candidate BeXs. There are additional confirmed and candidate BeXs for which only Porb or Pgpin is 
known, but these are not shown. The dashed green lines mark a selection box that conservatively 
includes all confirmed BeXs for which Porb and Pgpm have been measured. Candidate systems 
outside this box are excluded from our sample of probable BeXs. The spin and orbital periods of 
confirmed and probable BeX systems are correlated. The Spearman-rank correlation coefficient 
\s p = 0.49 {p = 3 X 10"^; N = 66) for the full sample and p = 0.49 (p = 4 x 10"^; = 47) for 
the confirmed systems (see Supplementary Information for a definition of p-values). The scatter 
around the correlation is o-iogp^pi„ = 0.7 dex relative to the best-fitting line with slope a = 2 (solid 
black line). Different colours indicate different host galaxies: blue, Milky Way; red, MC; black, LMC. 
The histograms shown in the top and right-hand panels show the number of BeXs with spin and 
orbital periods in the respective ranges covered by the selection box. In each of these panels, 
the thick line corresponds to confirmed BeXs only, and the thin line corresponds to confirmed and 
probable BeXs. The vertical dashed line is drawn at Porb = 60 d, the location of the apparent dip 
in the log Porb distribution. This value of Porb corresponds to Pspm ~ 40 s (horizontal dashed line), 
which marks a more pronounced dip in the logPspin distribution. 




Figure 2 The log Pspm distribution of confirmed and probable BeXs. Top panel: Distribution for all 
systems. Middle panels: Distribution broken down by host galaxy: SMC (second from top); Milky 
Way (MW)+LMC (third from top). All of these distributions are bimodal, and the double-Gaussian 
decomposition suggested by the KMM algorithm'^is shown in each panel. The number of systems 
contributing to each observed distribution and the associated p-value provided by the algorithm 
are also shown. Applying the KMM test to the subset of spectroscopically confirmed systems (not 
shown) gives p = 8 x 10"^ (N = 64). Bottom panel: Direct comparison of the decompositions for 
the independent SMC and Milky Way+LMC populations, showing them to be mutually consistent. 
Additional details regarding the statistical evidence for the existence of distinct sub-populations in 
the Pspin data are given in Supplementary Information. 




Figure 3 The dependence of eccentricity on Pspin among BeXs. Left panel: Pspin versus eccen- 
tricity for all confirmed and probable BeXs with measured spin periods and eccentricities. The 
vertical dashed line marks the approximate division between the short-Pspm and long-Pspm sub- 
populations (Figs 1 and 2). Right panel: Cumulative eccentricity distributions of these two sub- 
populations. A Kolmogorov-Smirnov test provides marginal evidence for a difference between 
these distributions {p = 0.093), with the short- Pspin population being characterized by lower eccen- 
tricities (see Supplementary Information ior additional discussion). 
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1 Introduction 

In the main paper, we present evidence for the existence of two distinct sub-populations of Be/X- 
ray binaries (BeXs), which are most clearly separated in the log Pspin distribution. We argue that 
these two populations are most likely associated with different types of neutron star formation 
channels, with electron-capture supernovae primarily producing short- Pgpin systems, and iron-core 
collapse supernovae primarily producing long-Pgpin systems. This particular assignment is largely 
based on the eccentricity distribution of BeXs, which suggests that short- Pspm (long-Pgpin) systems 
are preferentially associated with low-e (high-e) binary orbits. 

Below, we provide supplementary information for several aspects of our analysis. First, we 
add some historical context to our discovery of two BeX sub-populations with distinct character- 
istic spin periods. Second, we provide additional details regarding the significance with which 
these sub-populations are detected in our data. Third, we discuss some subtleties regarding the 
interpretation of the BeX eccentricity distribution that help to properly assess the strength of the 
evidence for a connection between e and Pspin- We also show explicitly that there is no evidence 
for a connection between e and Porb- 

2 Historical Context 

The possible existence of a gap in the distribution between Pspm ^ 10 s and Pspm ~ 100 s was 
actually first suggested 30 years ago, although only 10 — 20 systems were known at the time^^ 
This suggestion does not seem to have been foUowed-up, however, perhaps because some later 
discoveries fell into this gap. In any case, to the best of our knowledge, the evidence for multiple 
sub-populations in the spin-period distribution has not been seriously reconsidered until now. 

3 The Statistical Evidence for Two Distinct Populations of BeXs 

We show in the main text that the evidence for two sub-populations contributing to the log Pspin 
distribution of BeXs is highly statistically significant according to the standard KMM test^^. The 
test statistic used by KMM is the likelihood ratio between the best single-Gaussian representation 
and the best double-Gaussian representation of the data. For the latter, we always find that the 
variances of the two Gaussians are consistent with each other. In our application of KMM, we 
thus adopt equal variances in the double Gaussian representation and then calculate the p-values 
robustly from bootstrap simulations^^. Here and throughout this paper, the quoted p- values have 
the usual statistical meaning, i.e. they represent the probability of obtaining a test statistic at least 
as extreme as the observed one, under the assumption that the null hypothesis is correct. In the 
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case of KMM, the null hypothesis is that the data are drawn from a single Gaussian. 

Figure 2 in the main text shows that the best double-Gaussian representation actually pro- 
vides quite a good match to the log Pspin data. It is nevertheless interesting to ask if we can find 
evidence for multiple sub-populations in our data set even without the assumption of normality. 
This can be checked by non-parametric methods, such as the "bandwidth test'^^. The standard 
bandwidth test is a non-parametric test for multi-modality based on bootstrapping from a kernel 
density estimate of the underlying probability distribution function (PDF). The statistic used in 
the test is the smallest kernel width that results in a unimodal estimate of the PDF. The standard 
bandwidth test is a strict test of modality, in the sense that the null hypothesis is only that the 
underlying PDF is unimodal, with no additional assumptions concerning its shape. However, a 
unimodal distribution with an obvious "shoulder", for example, may still be considered to provide 
strong evidence for multiple sub-populations. We can therefore also define an alternative test, in 
which the statistic used is the smallest kernel width that results in a kernel density estimate of the 
PDF that has only two inflection pointd^. Being entirely non-parametric, both of these tests are 
extremely robust. However, they are also much less powerful than KMM, mainly in the sense that 
their null hypotheses are so general that they are hard to rule out. The resulting values, which can 
be calibrated via Monte-Carlo simulation^, are thus always larger than those provided by KMM. 

We have applied both versions of the bandwidth test to the Pgpin data for confirmed and prob- 
able BeXs. For the standard (modality) bandwidth test, we first apply a skew-minimising Box-Cox 
transformation to the log Pspin data, because modes associated with distinct sub-populations can be 
either emphasised or suppressed by certain transformations^^. A transformation to symmetry is ex- 
pected to make any inherent bimodality (strictly speaking: "bimodalizability") easier to detecP^. 
For our BeX data set, the classic bandwidth test then rejects the null hypothesis of unimodality 
with p = 9 X 10~^ (application of this test directly to the log Pspin data would give p = 0.04). Ap- 
plying the alternative (inflection point) bandwidth test directly to the log Pspin data rejects the null 
hypothesis that there are only two inflection points in the underlying distribution withp = 1 x 10"^ 
(application of this test to the deskewed data would give p = 5 x 10^'^). Given the robust non- 
parametric nature of these tests, we consider these results to be strong evidence for the existence 
of two distinct BeX sub-populations. 

4 The Eccentricity Distribution of BeXs 

Figure 3 in the main text shows that there is marginally significant evidence that the eccentricity 
distribution of long-Pspin BeXs is different from that of short- Pspin systems. Some care has to be 
taken in interpreting this result. 

First, tidal circularization will always tend to reduce the eccentricity of close binary systems. 
However, the orbital periods within our BeX sample (Porb > 10 d) are too long for this mechanism 
to be effective-'*. Second, other things being equal, a given supernova kick velocity will induce 
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larger orbital eccentricities in wide, long-period, binaries, since such systems are more weakly 
bound. Given that Pspin is correlated with Porb in BeXs, this could produce a Fspin ~ e correlation 
in the observed sense. 

Does this latter bias affect our analysis? Supplementary Figure 1 shows the Porb versus e 
distribution for our BeX sample. Superposed on this, we also show the predicted locatiorpl, under 
the assumption that they all receive the large kicks typically associated with conventional iron-core- 
collapse ((fkick) — 450 km s^^). Two key conclusions may be drawn immediately from this plot. 
First, the observed distribution is clearly inconsistent with being drawn from a single, high-kick 
population. In fact, the low eccentricities (e < 0.4) seen in many BeXs with Porb ^ 20 d cannot be 
explained by large kicks at all. A possible connection between such low-eccentricity HMXBd^and 
low -kick electron capture supemovae has indeed been suggested before^^'^^. Second, the Porb — e 
plane does not separate the two BeX sub-populations as cleanly as the Pgpin — e plane. Indeed, if 
we split the BeX sample into short-period (Porb < 60 d) and long-period (Porb > 60 d) systems, 
their cumulative eccentricity distributions do not differ significantly (p = 0.35). Both conclusions 
indicate that the link between spin period and eccentricity suggested in the main paper is not simply 
due to a Porb -dependent bias. 

Finally, there are two obvious outliers from the trend that long-Pspin systems have preferen- 
tially high eccentricities (see Figure 3 in the main text). These systems are X Per and lA 1118- 
616, which actually have the longest spin periods (Pspin = 837.7 s and 407.7 s, respectively), but 
e ^ 0.1. Both of these object are known to be highly unusual within the BeX class. For example, 
X Per does not show the transient X-ray bursts associated with periastron passages that are the 
hallmark of almost all BeXs, while lA 1118-616 is one of the furthest systems from the standard 
Porb — Pspin Correlation (Figure 1 of the main text). Indeed, in both systems, the dominant accretion 
mode probably differs from that which establishes the spin equilibrium in other BeX^I^. This 
may explain why P,pin is not a valid tracer of formation channel in these two objects. Also, since 
lA 1118-616 has a surprisingly short orbital period considering its slow spin (Porb = 24 d^*^), 
the low eccentricity of this system is not actually in terrible conflict with a standard (high) kick 
scenario''^. If X Per and lA 1 118-616 are excluded from the eccentricity distribution of long-Pgpin 
systems, the difference to the short-Pgpin systems becomes highly significant {p = 0.006). A poste- 
riori outlier removal is, of course, a dangerous statistical practice, so this p- value cannot be taken 
quite at face value. Nevertheless, it is also interesting to note that all other high-P^pj^ systems in 
Figure 3 do, in fact, lie in the region predicted for systems produced via high-kick supernovae. 




Supplementary Figure 1 The dependence of eccentricity on Porb among BeXs. The 
left panel shows Porb versus eccentricity for all confirmed and probable BeX systems with 
measured spin and orbital periods and eccentricities. Long-Pspin systems with Pspin > 40 s 
are additionally marl^ed with open circles. The vertical dashed line marl^s the approxi- 
mate division between the short-Porb and long-Poib sub-populations (see Figure 1 in the 
main text). The shaded area shows the region predicted to contain 60% of systems by 
population synthesis calculations adopting a l^icl^ velocity distribution appropriate for iron- 
core-collapse with mean (t;kick) - 450 km s"^. The top right panel shows the cumulative 
eccentricity distributions of these two sub-samples. A Kolmogorov-Smirnov test provides 
no evidence for a significant difference between these distributions {p = 0.35). 
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